Corpus: eng_news_2005_100K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 92 99 99 99 99
1000 782 951 994 997 998
10000 5100 8803 9785 9950 9990
100000 23885 70416 90858 97674 99243
1000000 23885 70416 90859 97675 99244


Zipf's diagram for sentence endings


Gnuplot diagram

5930 msec needed at 2018-02-24 18:23